Overlapping Constraints of Two Step Selection to Generate a Transfer Dictionary

نویسندگان

  • Satoshi Shirai
  • Kazuhide Yamamoto
  • Kyonghee Paik
چکیده

Any machine translation system requires a transfer dictionary between the source and target languages. Typically, since the construction of such a dictionary is done by hand, a lot of time is taken and the cost is enormous. Considering this, we attempted the construction of a bilingual dictionary through the re-generation of already-existing language resources. Aiming at the generation of a KoreanJapanese dictionary, we extracted candidates of Korean and Japanese equivalent pairs by a two-step process of searching through a Korean-English dictionary rst and then searching through an EnglishJapanese dictionary. We also attempted the narrowing down of Korean-Japanese equivalent pairs by the overlapping of obtained Japanese translations. According to a trial experiment using 100 Korean words randomly taken, 61 correct Japanese translations were obtained. Among the correct translations, we took 25 translations for which a search of the English-Japanese dictionary successfully produced two or more translations for the English words obtained in the search results of the Korean-English dictionary. Of the 25 translations, 21 (84%) could be automatically narrowed down by taking the overlapped words from the Japanese translation sets for the individual English words. With the above twostep dictionary extraction, moreover, nine cases out of ten were correct when only one Japanese translation was obtained. These results show the possibility that Korean-Japanese translation pairs can be generated at an expected correctness rate of 44 out of 100 words when using the already proposed method that combines a Korean-English dictionary and a Japanese-English dictionary.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Generation of Helper Plasmids Encoding Mutant Adeno-associated Virus Type 2 Capsid Proteins with Increased Resistance against Proteasomal Degradation

  Objective(s): Adeno-associated virus type 2 (AAV2) vectors are widely used for both experimental and clinical gene therapy. A recent research has shown that the performance of these vectors can be greatly improved by substitution of specific surface-exposed tyrosine residues with phenylalanines. In this study, a fast and simple method is presented to generate AAV2 vector helper plasmids encod...

متن کامل

A Novel Image Denoising Method Based on Incoherent Dictionary Learning and Domain Adaptation Technique

In this paper, a new method for image denoising based on incoherent dictionary learning and domain transfer technique is proposed. The idea of using sparse representation concept is one of the most interesting areas for researchers. The goal of sparse coding is to approximately model the input data as a weighted linear combination of a small number of basis vectors. Two characteristics should b...

متن کامل

Speech Enhancement using Adaptive Data-Based Dictionary Learning

In this paper, a speech enhancement method based on sparse representation of data frames has been presented. Speech enhancement is one of the most applicable areas in different signal processing fields. The objective of a speech enhancement system is improvement of either intelligibility or quality of the speech signals. This process is carried out using the speech signal processing techniques ...

متن کامل

Nurses' attitude toward supportive work climate affecting transfer of learning to job

Introduction: Transfer of training has been defined as the application of new knowledge, skills, and attitudes learned from continuing education programs to the job. Learning transfer can be influenced by many factors that can facilitate or hinder it. A supportive work climate is crucial for successful transfer of learning to job. The aim of this study was to evaluate the nurses' attitude towar...

متن کامل

A Sparse Representation Method to Detect Saffron Agricultural Lands Using Sentinel-II Satellite Images Time

Nowadays, agricultural management via remote sensing technology has gained a special position among managers and the people who are in charge of this industry. Saffron (Red Gold) is one of specific Iran’s agricultural products with a high economic valance which is used in different fields of food and medical industries. Considering the cultivation conditions of the saffron, there has not a pers...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001